Discriminative Reordering Model for Machine Translation
نویسنده
چکیده
We have built a discriminative reordering model for the phrase-based machine translation system Moses, which is developed at the University of Edinburgh. The model is a maximum entropy classifier which incorporates a variety of feature functions to predict phrase orientation for machine translation. Two kinds of features reported in literature, namely lexical features and dependency path feature have been tested in the discriminative model. We have also proposed and tested a novel feature named dependency orientation and modified the dependency path feature with lexicalization. Two baseline models are used in evaluation, namely, distance-based model without lexicalized reordering, and the lexicalized reordering model. We are able to achieve significant BLEU gains over the distance based model by up to 0.95 absolute points, and BLEU gains over the lexicalized reordering model by up to 0.74 absolute points. Discriminative reordering model is a very generic framework and is open to many more features for further improvement.
منابع مشابه
Discriminative Reordering Extensions for Hierarchical Phrase-Based Machine Translation
In this paper, we propose novel extensions of hierarchical phrase-based systems with a discriminative lexicalized reordering model. We compare different feature sets for the discriminative reordering model and investigate combinations with three types of non-lexicalized reordering rules which are added to the hierarchical grammar in order to allow for more reordering flexibility during decoding...
متن کاملDiscriminative Reordering Model Adaptation via Structural Learning
Reordering model adaptation remains a big challenge in statistical machine translation because reordering patterns of translation units often vary dramatically from one domain to another. In this paper, we propose a novel adaptive discriminative reordering model (DRM) based on structural learning, which can capture correspondences among reordering features from two different domains. Exploiting...
متن کاملDynamic distortion in a discriminative reordering model for statistical machine translation
Most phrase-based statistical machine translation systems use a so-called distortion limit to keep the size of the search space manageable. In addition, a distance-based distortion penalty is used as a feature to keep the decoder to translate monotonically unless there is sufficient support for a jump from other features, particularly the language models. To overcome the issue of setting the op...
متن کاملLarge-scale Reordering Model for Statistical Machine Translation using Dual Multinomial Logistic Regression
Phrase reordering is a challenge for statistical machine translation systems. Posing phrase movements as a prediction problem using contextual features modeled by maximum entropy-based classifier is superior to the commonly used lexicalized reordering model. However, Training this discriminative model using large-scale parallel corpus might be computationally expensive. In this paper, we explor...
متن کاملInducing a Discriminative Parser to Optimize Machine Translation Reordering
This paper proposes a method for learning a discriminative parser for machine translation reordering using only aligned parallel text. This is done by treating the parser’s derivation tree as a latent variable in a model that is trained to maximize reordering accuracy. We demonstrate that efficient large-margin training is possible by showing that two measures of reordering accuracy can be fact...
متن کامل